Techniques for estimating vocal-tract shapes from the speech signal
نویسندگان
چکیده
This paper reviews methods for mapping from the acoustical properties of a speech signal to the geometry of the vocal tract that generated the signal. Such mapping techniques are studied for their potential application in speech synthesis, coding, and recognition. Mathematically, the estimation of the vocal tract shape from its output speech is a so-called inverse problem, where the direct problem is the synthesis of speech from a given time-varying geometry of the vocal tract and glottis. Different mappings are discussed: mapping via articulatory codebooks, mapping by nonlinear regression, mapping by basis functions, and mapping by neural networks. Besides being nonlinear, the acoustic-to-geometry mapping is also nonunique, i.e., more than one tract geometry might produce the same speech spectrum. We will show how this nonuniqueness can be alleviated by imposing continuity constraints.
منابع مشابه
Recovering vocal tract shapes from MFCC parameters
Recovering vocal tract shapes from the speech signal is a well known inversion problem of transformation from the articulatory system to speech acoustics. Most of the studies on this problem in the past have been focused on vowels. There have not been general methods e ective for recovering the vocal tract shapes from the speech signal for all classes of speech sounds. In this paper we describe...
متن کاملEstimation of vocal-tract shape from speech spectrum and speech resynthesis based on a generative model
Precise control of articulatory parameters is difficult and prevents a physical model from generating natural sounding speech signals. To determine vocal-tract shape from speech, this paper presents an inversion method for simultaneously estimating the cross-sectional area and length of the vocal tract. In addition, we performed speech resynthesis from a time-series of estimated vocal-tract sha...
متن کاملEstimating the vocal-tract area function and the derivative of the glottal wave from a speech signal
We present a new method for estimating the vocal-tract area functions from speech signals. First, we point out and correct a long-standing sign error in some literature related to the derivation of the acoustic reflection coefficients of the vocal tract from a speech signal. Next, to eliminate the influence of the glottal wave on the estimation of the vocal-tract filter, we estimate the vocal-t...
متن کاملAn empirical investigation of the nonuniqueness in the acoustic-to-articulatory mapping
Articulatory inversion is the problem of recovering the sequence of vocal tract shapes that produce a given acoustic speech signal. Traditionally, its difficulty has been attributed to nonuniqueness of the inverse mapping, where different vocal tract shapes can produce the same acoustics. However, evidence for the nonuniqueness has been restricted to theoretical studies, or to data from atypica...
متن کاملValidation of Optimum Algorithm Parameters Required to Estimate Vocal Tract Shape for Children Using LPC Analysis
Severe or profound deafness in hearing impaired children, can curb their ability to speak due to the lack of auditory feedback. There has been a considerable attempt in developing commercial speech training aids for such children which give feedback of acoustic and articulatory parameters. Speech training aids based on visual feedback of vocal tract shape (VTS) are reported to be useful for the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEEE Trans. Speech and Audio Processing
دوره 2 شماره
صفحات -
تاریخ انتشار 1994